Towards Robust Deep Neural Networks for Affect and Depression Recognition from Speech

نویسندگان

چکیده

Intelligent monitoring systems and affective computing applications have emerged in recent years to enhance healthcare. Examples of these include assessment states such as Major Depressive Disorder (MDD). MDD describes the constant expression certain emotions: negative emotions (low Valence) lack interest Arousal). High-performing intelligent would diagnosis its early stages. In this paper, we present a new deep neural network architecture, called EmoAudioNet, for emotion depression recognition from speech. Deep EmoAudioNet learns time-frequency representation audio signal visual spectrum frequencies. Our model shows very promising results predicting affect depression. It works similarly or outperforms state-of-the-art methods according several evaluation metrics on RECOLA DAIC-WOZ datasets arousal, valence, Code is publicly available GitHub: https://github.com/AliceOTHMANI/EmoAudioNet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust speech recognition with speech enhanced deep neural networks

We propose a signal pre-processing front-end to enhance speech based on deep neural networks (DNNs) and use the enhanced speech features directly to train hidden Markov models (HMMs) for robust speech recognition. As a comprehensive study, we examine its effectiveness for different acoustic features, acoustic models, and training-testing combinations. Tested on the Aurora4 task the experimental...

متن کامل

Factored Deep Convolutional Neural Networks for Noise Robust Speech Recognition

In this paper, we present a framework of a factored deep convolutional neural network (CNN) learning for noise robust automatic speech recognition (ASR). Deep CNN architecture, which has attracted great attention in various research areas, has also been successfully applied to ASR. However, to ensure noise robustness, since merely introducing deep CNN architecture into the acoustic modeling of ...

متن کامل

Binary Deep Neural Networks for Speech Recognition

Deep neural networks (DNNs) are widely used in most current automatic speech recognition (ASR) systems. To guarantee good recognition performance, DNNs usually require significant computational resources, which limits their application to low-power devices. Thus, it is appealing to reduce the computational cost while keeping the accuracy. In this work, in light of the success in image recogniti...

متن کامل

Deep segmental neural networks for speech recognition

Hybrid systems which integrate the deep neural network (DNN) and hidden Markov model (HMM) have recently achieved remarkable performance in many large vocabulary speech recognition tasks. These systems, however, remain to rely on the HMM and assume the acoustic scores for the (windowed) frames are independent given the state, suffering from the same difficulty as in the previous GMM-HMM systems...

متن کامل

Towards Robust Deep Neural Networks with BANG

Machine learning models, including state-of-the-art deep neural networks, are vulnerable to small perturbations that cause unexpected classification errors. This unexpected lack of robustness raises fundamental questions about their generalization properties and poses a serious concern for practical deployments. As such perturbations can remain imperceptible – commonly called adversarial exampl...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-68790-8_1